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ECO5830 2458 bp DNA linear BCT 08-MAR-2000 

Escherichia coli tatABCD operon. 

AJ005830 

AJ005830.1 GI:3123495 

mttB gene; mttC gene; Sec -independent protein translocase; tatA 
gene; TatA protein; tatB gene; TatB protein; tatC gene; TatC 
protein; tatD gene; TatD protein; yigU gene; yigW gene. 
Escherichia coli 
Escherichia coli 

Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales ; 

Enterobacteriaceae ; Escherichia . 

1 

Blattner,F.R. , Plunkett,G. Ill, Bloch,C.A., Perna,N.T., Burland,V., 
Riley, M. , Collado-Vides , J . , Glasner , J.D . , Rode,C.K., Mayhew,G.F., 
Gregor,J., Davis,N.W. , Kirkpatrick, H . A. , Goeden,M.A., Rose,D.J., 
Mau,B. and Shao,Y. 

The complete genome sequence of Escherichia coli K-12 

Science 277 (5331), 1453-1474 (1997) 

97426617 

9278503 



Sargent, F. 
Berks, B.C. 



gene 
CDS 



Bogsch,E.G., Stanley , N . R . , Wexler,M., Robinson, C, 
and Palmer, T." 

Overlapping functions of components of a bacterial Sec -independent 

protein export pathway 

EMBO J. 17 (13), 3640-3650 (1993) 

98315056 

9649434 

3 (bases 1 to 2458) 

Palmer, T. 

Direct Submission 

Submitted (30-APR-1998) Palmer T. , Nitrogen Fixation Laboratory, 
John Innes Centre, NORWICH, NR4 7UH, UNITED KINGDOM 
Related sequence: AE000459. 

Location/Qualifiers 

1. .2458 

/organism= "Escherichia coli" 

/mol_type= "genomic DNA" 

/strain="K-12" 

/ sub_s t rain= » MC4 100" 

/ db_xr e f = » t axon : 5 6 2 " 

56. .325 

/gene="tatA" 

56. .325 

/gene="tatA" 

/function=" Sec -independent protein translocase" 
/codon start=l 
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/ trans l_table= 11 
/product="TatA protein" 
/protein id=" CAA06724 . 1 " 
/db_xref ="GI :3123496" 
/db_xref =" SWISS- PROT: 06593 8" 

/ 1 r ans 1 at ion= " MGGISIWQLL I IAVI WLLFGTKKLGS IGSDLGAS IKGFKKAMS 

DDEPKQDKTSQDADFTAKTIADKQADTNQEQAKTEDAKRHDKEQV" 
gene 329 . . 844 

/gene="tatB" 
CDS 329. . 844 

/gene="tatB" 

/function= "Sec -independent protein translocase" 

/codon_start=l 

/transl table=ll 

/product="TatB protein" 

/protein id=" CAA06725 . 1 " 

/db_xref="GI:312 34 97" 

/db_xref = » SWISS - PROT : 06 94 15 " 

/ 1 rans 1 at ion- " riFD IGFSELLLVFI IGL v"v^GPQRLPVAVKTVAG 

TVQNELTQELKLQEFQDSLKKVEKASLTNLTPELKASMDELRQAAESMKRSYVANDPE 

KASDEAHTIHNPWKDNEAAHEGVTPAAAQTQASSPEQKPETTPEPWKPAADAEPKT 

AAPSPSSSDKP" 
conflict 380. .384 

/gene="tatB" 

/citation= [1] 

/replace= "gccgcct " 
gene 847 . . 1623 

/gene="tatC" 
CDS 847. . 1623 

/gene="tatC" 

/function=" Sec -independent protein translocase" 

/note="alternative gene names: yigU, mttB" 

/codon_start=l 

/transl table=ll 

/product="TatC protein" 

/protein id= " CAA06726 . 1 " 

/db_xref="GI : 3123498" 

/db_xref = " SWISS - PROT : P2 7 8 5 7 " 

/ trans lation= "MSVEDTQPLITHLIELRKRLLNCIIAVIVIFLCLVYFANDIYHL 
VSAPLIKQLPQGSTMIATDVASPFFTPIKLTFMVSLILSAPVILYQVWAFIAPALYKH 
ERRLWPLLVSSSLLFYIGMAFAYFVVFPLAFGFLANTAPEGVQVSTDIASYLSFVMA 
LFMAFGVSFEVPVAIVLLCWMGITSPEDLRKKRPYVLVGAFWGMLLTPPDVFSQTLL 
AIPMYCLFEIGVFFSRFYVGKGRNREEENDAEAESEKTEE" 

g ene 1653 . .2447 

/gene="tatD" 

CDS 1653 . .2447 

/gene="tatD" 

/f unction="Sec-independent protein translocase" 
/note="alternative gene names: yigW, mttC" 
/codon_start=l 
/transl_table=ll 
/product="TatD protein" 
/prot ein_id= " CAA06727.1 " 
- /db_xref="GI:3123499" 
/ db_xr e f = " S WI SS - PROT : P2 7 8 5 9 " 

/translations "MEYRMFDIGVNLTSSQFAKDRDDWACAFDAGVNGLLITGTNLR 
ESQQAQKLARQYSSCWSTAGVHPHDSSQWQAATEEAIIELAAQPEWAIGECGLDFNR 
NFSTPEEQERAFVAQLRIAADLNMPVFMHCRDAHERFMTLLEPWLDKLPGAVLHCFTG 
TREEMQACVAHGIYIGITGWVCDERRGLELRELLPLIPAEKLLIETDAPYLLPRDLTP 
KPSSRRNEPAHLPHILQRIAHWRGEDAAWLAATTDANVKTLFGIAF" 
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conflict 2204.. 2205 

/gene= M tatD" 

/citation= [1] 

/replace="acc" 
BASE COUNT 573 a 604 c 684 g 597 t 

ORIGIN 

1 cgtgtaacgt ataatgcggc tttgtttaat catcatctac cacagaggaa catgtatggg 
61 tggtatcagt atttggcagt tattgattat tgccgtcatc gttgtactgc tttttggcac 
121 caaaaagctc ggctccatcg gttccgatct tggtgcgtcg atcaaaggct ttaaaaaagc 
181 aatgagcgat gatgaaccaa agcaggataa aaccagtcag gatgctgatt ttactgcgaa 
241 aactatcgcc gataagcagg cggatacgaa tcaggaacag gctaaaacag aagacgcgaa 
301 gcgccacgat aaagagcagg tgtaatccgt gtttgatatc ggttttagcg aactgctatt 
361 ggtgttcatc atcggcctcg tcgttctggg gccgcaacga ctgcctgtgg cggtaaaaac 
421 ggtagcgggc tggattcgcg cgttgcgttc actggcgaca acggtgcaga acgaactgac 
481 ccaggagtta aaactccagg agtttcagga cagtctgaaa aaggttgaaa aggcgagcct 
541 cactaacctg acgcccgaac tgaaagcgtc gatggatgaa ctacgccagg ccgcggagtc 
601 gatgaagcgt tcctacgttg caaacgatcc tgaaaaggcg agcgatgaag cgcacaccat 
661 ccataacccg gtggtgaaag ataatgaagc tgcgcatgag ggcgtaacgc ctgccgctgc 
721 acaaacgcag gccagt tcgc eggaacagaa gccagaaacc acgccagagc cggtggtaaa 
781 acctgctgcg gacgctgaac cgaaaaccgc tgcaccttcc ccttcgtcga gtgataaacc 
841 gtaaacatgt ctgtagaaga tactcaaccg cttatcacgc atctgattga gctgcgtaag 
901 cgtctgctga actgcattat cgcggtgatc gtgatattcc tgtgtctggt ctatttcgcc 
961 aatgacatct atcacctggt atccgcgcca ttgatcaagc agttgccgca aggttcaacg 
1021 atgatcgcca ccgacgtggc ctcgccgttc tttacgccga tcaagctgac ctttatggtg 
1081 tcgctgattc tgtcagcgcc ggtgattctc tatcaggtgt gggcatttat cgccccagcg 
1141 ctgtataagc atgaacgtcg cctggtggtg ccgctgctgg tttccagctc tctgctgttt 
1201 tatatcggca tggcattcgc ctactttgtg gtctttccgc tggcatttgg cttccttgcc 
12 61 aataccgcgc cggaaggggt gcaggtatcc accgacatcg ccagctattt aagcttcgtt 
1321 atggcgctgt ttatggcgtt tggtgtctcc tttgaagtgc cggtagcaat tgtgctgctg 
1381 tgctggatgg ggattacctc gccagaagac ttacgcaaaa aacgcccgta tgtgctggtt 
1441 ggtgcattcg ttgtcgggat gttgctgacg ccgccggatg tcttctcgca aacgctgttg 
1501 gcgatcccga tgtactgtct gtttgaaatc ggtgtcttct tctcacgctt ttacgttggt 
1561 aaagggcgaa atcgggaaga ggaaaacgac gctgaagcag aaagcgaaaa aactgaagaa 
1621 taaattcaac cgcccgtcag ggcggttgtc atatggagta caggatgttt gatatcggcg 
1681 ttaatttgac cagttcgcaa tttgcgaaag accgtgatga tgttgtagcg tgcgcttttg 
1741 acgcgggagt taatgggcta ctcatcaccg gcactaacct gcgtgaaagc cagcaggcgc 
1801 aaaagctggc gcgtcagtat tcgtcctgtt ggtcaacggc gggcgtacat cctcacgaca 
1861 gcagccagtg gcaagctgcg actgaagaag cgattattga gctggccgcg cagccagaag 
1921 tggtggcgat tggtgaatgt ggtctcgact ttaaccgcaa cttttcgacg ccggaagagc 
1981 aggaacgcgc ttttgttgcc cagctacgca ttgccgcaga tttaaacatg ccggtattta 
2 041 tgcactgtcg cgatgcccac gagcggttta tgacattgct ggagccgtgg ctggataaac 
2101 tgcctggtgc ggttcttcat tgctttaccg gcacacgcga agagatgcag gcgtgcgtgg 
2161 cgcatggaat ttatatcggc attaccggtt gggtttgcga tgaacgacgc ggactggagc 
2221 tgcgggaact tttgccgttg attccggcgg aaaaattact gatcgaaact gatgcgccgt 
22 81 atctgctccc tcgcgatctc acgccaaagc catcatcccg gcgcaacgag ccagcccatc 
2 341 tgccccatat tttgcaacgt attgcgcact ggcgtggaga agatgccgca tggctggctg 
2401 ccaccacgga tgctaatgtc aaaacactgt ttgggattgc gttttagagt ttgcggaa 
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T 1: AE000459. Escherichia coli ...[gi:2367306] 



Links 



LOCUS AE000459 11057 bp DNA linear BCT 01-DEC-2000 

DEFINITION Escherichia coli K12 MG1655 section 349 of 400 of the complete 
genome . 

ACCESSION AE000459 U00096 
VERSION AE000459.1 GI:2367306 

KEYWORDS 

SOURCE Escherichia coli K12 

ORGANISM Escherichia coli K12 

Bacteria ; Proteobac t eria ; Gammaproteobac ter ia ; Enterobacteriales ; 
Enterobacteriaceae ; Escherichia . 
REFERENCE 1 (bases 1 to 11057) 

AUTHORS Blattner , F . R . , Plunkett # G. Ill, Bloch,C.A., Perna,N.T., Burland,V., 
Riley, M., Collado-Vides , J . , Glasner , J.D. , Rode,C.K., Mayhew,G.F., 
Gregor,J., Davis, N.W., Kirkpatrick,H.A. , Goeden,M.A., Rose, D. J., 
Mau, B . and Shao,Y. 
TITLE The complete genome sequence of Escherichia coli K-12 

JOURNAL Science 277 (5331), 1453-1474 (1997) 
MEDLINE 97426617 
PUBMED 9278503 
REFERENCE 2 (bases 1 to 11057) 
AUTHORS Blattner , F . R . 
TITLE Direct Submission 

JOURNAL Submitted (16- JAN-1997) Guy Plunkett III, Laboratory of Genetics,. 

University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. 

Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 

608-263-7459 
REFERENCE 3 (bases 1 to 11057) 
AUTHORS Blattner , F . R . 
TITLE Direct Submission 

JOURNAL Submitted ( 02 -SEP- 1997) Guy Plunkett III, Laboratory of Genetics, 
University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. 
Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 
608-263-7459 
REFERENCE 4 (bases 1 to 11057) 

AUTHORS Plunkett , G . Ill . 

TITLE Direct Submission 

JOURNAL Submitted ( 13 -OCT- 1998) Laboratory of Genetics, University of 
Wisconsin, 445 Henry Mall, Madison, WI 53706, USA 
COMMENT On Sep 9, 1997 this sequence version replaced gi : 1790264 . 

This sequence was determined by the E. coli Genome Project at the 
University of Wisconsin-Madison (Frederick R. Blattner, director) . 
Supported by NIH grants HG00301 and HG01428 (from the Human Genome 
Project and NCHGR) . The entire sequence was independently 
determined from E. coli K12 strain MG1655 . Predicted open reading 
frames were determined using GeneMark software, kindly supplied by 
Mark Borodovsky, Georgia Institute of Technology, Atlanta-, GA, 
. 30332 [e-mail: mark@amber.gatech.edu]. Open reading frames that 
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have been correlated with genetic loci are being annotated with CG 
Site Nos., unique ID nos . for the genes in the E. coli Genetic 
Stock Center (CGSC) database at Yale University, kindly supplied by 
Mary Berlyn. A public version of the database is accessible 
( http : //cgsc .biology .vale . edu ) . Annotation of the genome is an 
ongoing task whose goal is to make the genome sequence more useful 
by correlating it with other data. Comments to the authors are 
appreciated. Updated information will be available at the E. coli 
Genome Project's World Wide Web site 

( http : //www. genetics .wise .edu ) . *** The E. coli K12 sequence and 
its annotations are periodically updated; this is version M54 . No 
sequence changes. Annotation updates: updated gene identifications 
and products; all new functional assignments courtesy of Monica 
Riley; added promoters, protein binding sites, and repeated 
sequences described in reference 1. The unique numeric identifiers 
beginning with a lowercase ' b' assigned to each gene (protein- or 
RNA-encoding) are now designated as gene synonyms instead of 
labels. This should allow them to be searched for in Entrez as gene 



names 



FEATURES 

source 



promoter 



p romoter 



gene 



CDS 



promoter 



gene 



CDS 



Location/Qualifiers 
1. . 11057 

/organism= "Escherichia coli K12" 
/mol_type="genomic DNA" 
/strain="K12" 
/sub_strain="MG1655" 
/ db_xr e f="t axon : 8 3 3 3 3 " 
complement (8 . .36) 

/note=" factor Sigma70; predicted +1 start at 4013851" 
27. .56 

/note="f actor Sigma70; predicted +1 start at 4013913" 
168 . . 929 
/gene="udp " 

/no te= " synonym : b3 831" 
168 . . 929 
/gene="udp " 
/EC_number= " 2.4.2.3 " 

/function="enzyme; Salvage of nucleosides and nucleotides" 

/note="o253; 100 pet identical to UDP_EC0LI SW: P12758" 

/codon_start=l 

/ 1 r ans 1_ t ab 1 e = 11 

/product = "uridine phosphorylase " 

/protein id=" AAC76834 . 1 " 

/db_xref ="GI : 1790265" 

/ 1 r ans 1 a t i on= " MS KSDVFHLGLTKNDLQGATLAI VPGDPDRVEKI AALMDKPVKL 

ASHREFTTWRAELDGKPVIVCSTGIGGPSTSIAVEELAQLGIRTFLRIGTTGAIQPHI 

NVGDVLVTTASVRLDGASLHFAPLEFPAVADFECTTALVEAAKSIGATTHVGVTASSD 

TFYPGQERYDTYSGRWRHFKGSMEEWQAMGVMNYEMESATLLTMCASQGLRAGMVAG 

VI VNRTQQE I PNAETMKQTESHAVKI WEAARRLL " 

1009. . 1037 

/note=" factor Sigma70; predicted +1 start at 4014894" 

1070. .2497 

/gene= "yigN" 

/note=" synonym: b3 83 2" 

1070 . .2497 

/gene="yigN" 

/function="phenotype; Not classified" 

/note="o475; 99 pet identical amino acid sequence and 
equal length to YIGN_ECOLI SW : P27850" 
/codon_start=l 
/transl table=ll 
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g ene 



gene 



CDS 



promoter 



gene 



/product= "putative alpha helix chain" 
/protein_id= " AAC76835.1 " 
/db_xref = "GI : 1790266" 

/trans la t ion = "OTFSIMWAVIALVGVAIGWLFASYQHAQQKAEQLAEREEMVAE 

LSAAKQQITQSEHWRAECELLNNEVRSLQSINTSLEADLREVTTRMEAAQQHADDKIR 

QMINSEQRLSEQFENLANRIFEHSNRRVDEQNRQSLNSLLSPLREQLDGFRRQVQDSF 

GKEAQERHTLTHEIRNLQQLNAQMAQEAINLTRALKGDNKTQGNWGEVVLTRVLEASG 

LREGYEYETQVSIENDARSRMQPDVIVRLPQGKDWIDAKMTLVAYERYFNAEDDYTR 

ESALQEHIASVRNHIRLLGRKDYQQLPGLRTLDYVLMFIPVEPAFLLALDRQPELITE 

ALKNNIMLVSPTTLLVALRTIANLWRYEHQSRNAQQIADRASKLYDKMRLFIDDMSAI 

GQSLDKAQDNYRQAMKKLS SGRGNVLAQAEAFRGLGVE I KRE INPDLAEQAVSQDEEY 

RLRS VPEQPNDEAYQRDDE YNQQSR " 

2451. .2482 

/gene= n yigN" 

/note="f actor Sigma70; predicted +1 start at 4016339" 

2592 . .3347 

/gene="ubiE" 

/note=" synonym: b3 833" 



CDS 



/gene="ubiE" 

/function= "enzyme ; Biosynthesis of cof actors, carriers: 
Menaquinone, ubiquinone" 

/note="o251; 100 pet identical to YIGO_ECOLI SW: P27851" 

/codon_start=l 

/ 1 r ans l_t ab 1 e = 11 

/product = " 2 -oc taprenyl - 6 -me thoxy- 1 , 4 -benzoquinone - - > 
2 -oc tapreny 1 - 3 -methyl - 6 -me thoxy- 1 , 4 -benzoquinone " 
/protein id= M AAC768 3 6 . 1 " 
/db_xref="GI : 2367307" 

/ trans lation= "MVDKSQETTHFGFQTVAKEQKADMVAHVFHSVASKYDVMNDLMS 

FGIHRLWKRFTIDCSGVRRGQTVLDLAGGTGDLTAKFSRLVGETGKWLADINESMPK 

MGREKLRNIGVIGNVEYVQANAEALPFPDNTFDCITISFGLRNVTDKDKALRSMYRVL 

KPGGRLLVLEFSKPIIEPLSKAYDAYSFHVLPRIGSLVANDADSYRYLAESIRMHPDQ 

DTLKAMMQDAGFESVDYYNLTAGWALHRGYKF " 

3361. .3966 

/gene="yigP" 

/no t e = " synonym : b3 8 3 4 " 

3361. .3966 

/gene= M yigP" 

/function="orf ; Unknown" 

/note="o201; sequence changes cause internal changes to 
thesequence relative to earlier versions (YIGP_ECOLI SW: 
P27852) but the N- andC- terminal sequences are unaltered" 
/ codon_start=l 
/trans l_table=ll 

/product="orf , hypothetical protein" 
/protein id= " AAC76837 . 1 " 
/db_xref="GI:2367308" 

/trans la tion= "MPFKPLVTAGIESLLNTFLYRSPALKTARSRLLGKVLRVEVKGF 

STSLILVFSERQVDVLGEWAGDADCTVIAYASVLPKLRDRQQLTALIRSGELEVQGDI 

QWQNFVALADLAE FDPAELLAP YTGD I AAEG I S KAMRGGAKFLHHG I KRQQR YVAEA 

ITEEWRMAPGPLEVAWFAEETAAVERAVDALTKRLEKLEAK" 

3858. .3875 

/gene="yigP" 

/note="f actor Sigma54; predicted +1 start at 4017732" 

3963. .5603 

/gene= n yigR" 

/note= " synonym : b3 8 3 5 " 

3963 . .5603 

/gene= n yigR" 
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/f unction= "orf ; Unknown" 

/note="o546; sequence change joins ORFs yigQ, yigR, and 
yigS from earlier version (YIGQ_ECOLI SW: P27853; 
YIGR_ECOLI SW: P27854; YIGS_ECOLISW : P27855) ; very good 
match to Azotobacter chroococcum partial ORF1 GB : U48404" 
/codon_start=l 
/ trans l_table= 11 

/product="orf , hypothetical protein" 
/protein id=" AAC76838 . 1 " 
/db_xref="GI: 2367309" 

/ translation="MTPGEVRRLYFIIRTFLSYGLDELIPKMRITLPLRLWRYSLFWM 
PNRHKDKLLGERLRLALQELGPWIKFGQMLSTRRDLFPPHIADQLALLQDKVAPFDG 
KLAKQQIEAAMGGLPVEAWFDDFEIKPLASASIAQVHTARLKSNGKEWIKVIRPDIL 
PVIKADLKLIYRLARWVPRLLPDGRRLRPTEWREYEKTLIDELNLLRESANAIQLRR 
NFEDSPMLYIPEVYPDYCSEGMMViyiERIYGIPVSDVAALEKNGTNMKLLAERGVQVFF 
TQVFRDS FFHADMHPGNI FVS YEHPENPKYI G IDCGI VGSLNKEDKRYLAENF I AFFN 
RDYRKVAELHVDSGWVPPDTNVEEFEFAIRTVCEPIFEKPLAEISFGHVLLNLFNTAR 
RFNME VQPQLVLLQKTLL YVEGVGRQLYPQLDLWKTAKPFLES WI KDQVG I PALVRAF 
KE KAP F-WVE KMPELPEL VYD SLRQG'KY-LQKS VDK I ARELQ S'NKVRQGQSRYFLGI GAT 
LVLSGTFLLVSRPEWGLMPGWLMAGGLIAWFVGWRKTR" 

gene 5640. .5951 

/gene= n b3836" 

CDS 5640.. 5951 

/gene= n b3836" 

/f unction="orf ; Unknown" 

/note= n ol03 ; sequence change split ORF of earlier 
version;was part of yigT (o261; YIGT_ECOLI SW : P27856) " 
/ codon_start=l 
/transl tableau 

/product="orf , hypothetical protein" 
/protein_id= " AAC76839 . 1 " 
/db_xref="GI :2367310" 

/ 1 ranslat ion= "MRLCLII I YHRGTCMGGISIWQLLI IAVI WLLFGTKKLGS IGS 
DLGAS I KGFKKAMSDDEPKQDKTSQDADFTAKTI ADKQADTNQEQAKTEDAKRHDKEQ 
V" 

gene 5815. .6156 

/gene= M b3 83 7" 
CDS 5815 . .6156 

/gene="b3837" 

/f unction= "putative factor; Not classified" 

/note="oll3 ; sequence change split ORF of earlier 

version;was part of yigT (o261; YIGT_ECOLI SW: P27856) » 

/ codon_start=l 

/transl_table=ll 

/product = "putative histone" 

/protein_id= " AAC76840.1 " 

/db_xref= n GI:2367311" 

/ translation= "MMNQSRIKPVRMLILLRKLSPISRRIRIRNRLKQKTRSATIKSR 

CNPCLISVLANCYWCSSSASPPSGAATTACGGKNGSGLDSRVAFTGDNGAERTDPGVK 

TPGVSGQSEKG" 
gene 6035. .6472 

/gene="b383 8" 
CDS 6035 . . 6472 

/gene="b3 83 8" 

/f unction =" orf ; Unknown" 

/note="ol45; sequence change split ORF of earlier 
version;was part of yigT (o261; YIGT_ECOLI SW: P27856) « 
/codon_start=l 
/transl table=ll 

/product="orf , hypothetical protein" 
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/protein id=" AAC7 6841 . 1 " 
/db_xref= n GI: 2 367312" 

/ translations "MAVKTVAGWIRALRSLATTVQNELTQELKLQEFQDSLKKVEKAS 
LTNLTPELKASMDELRQAAESMKRSYVANDPEKASDEAHTIHNPVVKDNEAAHEGVTP 
AAAQTQASSPEQKPETTPEPWKPAADAEPKTAAPSPSSSDKP" 

gene 6475.. 7251 

/gene= "yigU" 

/note=" synonym: b3839" 

CDS 6475.. 7251 

/gene="yigU" 

/f unction = "or f ; Unknown" 

/note="o258; sequence change joins two ORFs relative 
toearlier version; 97.7 pet identical to the conceptual 
ORF YIGU_ECOLI SW : P27857" 
/ codon_start=l 
/trans l_table= 11 

/product="orf , hypothetical protein" 
/protein id=" AAC76842 . 1 " 
/ db_xr-e f = " G-I : 2 3 6 7-3 1 3" 

/ 1 rans la t ion= " MSVEDTQPLITHLI ELRKRLLNC I IAVI VI FLCLVYFAND I YHL 
VSAPLIKQLPQGSTMIATDVASPFFTPIKLTFMVSLILSAPVILYQVWAFIAPALYKH 
ERRLWPLLVSSSLLFYIGMAFAYFWFPLAFGFLANTAPEGVQVSTDIASYLSFVMA 
LFMAFGVSFEVPVAIVLLCWMGITSPEDLRKKRPYVLVGAFWGMLLTPPDVFSQTLL 
AI PM YCL FE I GVF F S RF YVGKGRNREEEND AEAE S E KTEE " 

gene 7281. .7901 

. /gene="yigW_l" 

/note=" synonym: b3 84 0" 

CDS 7281. .7901 

/gene="yigW_l" 

/f unction="orf ; Unknown" 

/note="o206; 98 pet identical to N-terminal 184 residues 
of a 264 aa conceptual translation YIGW_ECOLI SW : P27859" 
/ codon_s t ar t = 1 
/transl table=ll 

/product= "orf, hypothetical protein" 
/protein id=" AAC76843 . 1 " 
/db_xref = »GI : 1790274 " 

/translation="ME YRM FD I GVNLTS S QFAKDRDD WACAFD AG VNGLL I TGTNLR 
ESQQAQKLARQYSSCWSTAGVHPHDSSQWQAATEEAIIELAAQPEWAIGECGLDFNR 
NFSTPEEQERAFVAQLRIAADLNMPVFMHCRDAHERFMTLLEPWLDKLPGAVLHCFTG 
TREEMQAC VAHG I Y I G I TGWVCDEPTRTGAAGTFAVDSGGKI TDRN " 

gene 7735.. 8076 

/gene="yigW_2 " 

/note= " synonym : b3 84 1 " 

CDS 7735.. 8076 

/gene="yigW_2" 
/function="orf ; Unknown" 

/note="oll3; 100 pet identical to 80 residues of a 2$.4 aa 
conceptual translation YIGW_ECOLI SW: P27859" 
/ codon_start=l 
/transl tableau 

/product="orf , hypothetical protein" 
/protein id=" AAC76844 . 1 " 
/db_xref ="GI : 1790275 " 

/ trans lation= "MRFFIALPAHAKRCRRAWRMEFISALPVGFAMNRRGLELRELLP 
LIPAEKLLIETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGEDAAWLAATTD 
ANVKTLFGIAF" 
gene complement (8073 . . 8561) 

/gene="rfaH" 
/note= " synonym : b3 84 2 " 
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CDS 



promoter 



gene 



CDS 



gene 



CDS 



complement (8073 . .8561) 
/gene="rfaH" 

/ f unc t ion= " regulator ,• Macromolecule me t aboli sm : 
Lipopolysaccharide " 

/note="fl62; 100 pet identical to RFAH_ECOLI SW: P26614; 
alternate gene names sfrB, hlyT" 
/codon_start=l 
/transl table=ll 

/product="transcriptional activator affecting biosynthesis 
of lipopolysaccharide core, F pilin, and haemolysin" 
/protein id= " AAC7 6 84 5 . 1 " 
/db_xref ="GI : 1790276 " 

/ 1 r ans la t ion= " MQSWYLLYCKRGQLQRAQEHLERQAVNCLAPMITLEKI VRGKRT 
AVSEPLFPNYLFVEFDPEVIHTTTINATRGVSHFVRFGAS PAIVPSAVIHQLSVYKPK 
DIVDPATPYPGDKVIITEGAFEGFQAIFTEPDGEARSMLLLNLINKEIKHSVKNTEFR 
KL" 

complement (85 91 . .8619) 

/note="f actor Sigma70; predicted +1 start at 4022434" 

O *T r "7 OTA ^ 

U KJ I ^ . . O / \J-±. 

/note="f actor Sigma70; predicted +1 start- at 4022561" 

8728. .10221 

/gene= "yigC" 

/note=" synonym: b3 84 3" 

8728. .10221 

/gene="yigC" 

/function="putative enzyme; Not classified" 
/note="o497 ; ??? pet identical to conceptual 
translationYIGC_ECOLI SW: P26615" 
/codon_start=l 
/transl _table=ll 

/product= "putative oxidoreductase " 
/protein id= " AAC76846 . 1 " 
/db_xref="GI : 1790277" 

/ trans lation= "MDAMKYNDLRDFLTLLEQQGELKRITLPVDPHLEITEIADRTLR 

AGGPALLFENPKGYSMPVLCNLFGTPKRVAMGMGQEDVSALREVGKLLAFLKEPEPPK 

GFRDLFDKLPQFKQVLNMPTKRLRGAPCQQKIVSGDDVDLNRIPIMTCWPEDAAPLIT 

WGLTWRGPHKERQNLGIYRQQLIGKKTKLIMRWLSHRGGALDYQEWCAAHPGERFPVS 

VALGAD PAT I LGAVTP VPDTLS E YAFAGLLRGTKTE WKC I S NDLE VP AS AE IVLEGY 

IEQGETAPEGPYGDHTGYYNEVDSFPVFTVTHITQREDAIYHSTYTGRPPDEPAVLGV 

ALNEVFVP ILQKQFPE I VDF YLPPEGCS YRLAWTI KKQ YAGHAJOIVMMGVWS FLRQF 

MYTKFVIVCDDDWARDWNDVIWAITTRMDPARDTVLVENTPIDYLDFASPVSGLGSK 

MGLDATNKWPGETQREWGRPIKKDPDWAHIDAIWDELAIFNNGKSA" 

10267. .10968 

/gene="ubiB" 

/note=" synonym: b3844" 

10267 . . 10968 

/gene="ubiB" 

/ EC_number = " 1.6.8. - " 

/function= "enzyme; Energy metabolism, carbon: Electron 
transport" 

/note="o233; 99 pet identical amino acid sequence and 
equal length to UBIB_EC0LI SW: P23486; similar to 
monooxygenase subunits f romMethylococcus capsulatus and 
Pseudomonas putida" 
/codon_start=l 
/transl table=ll 

/product="f errisiderophore reductase; flavin reductase 
(NADPH: flavin oxidoreductase)" 
/protein id=" AAC76847 . 1 " 
/db xref="GI:2367314" 
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/ trans 1 at ion= " MTTLSCKVTSVEAITDTVYRVRIVPDAAFSFRAGQYLMVVMDER 
DKRPFSMASTPDEKGFIELHIGASEINLYAKAVMDRILKDHQIWDIPHGEAWLRDDE 
ERPMILIAGGTGFSYARSILLTALARNPNRDITIYWGGREEQHLYDLCELEALSLKHP 
GLQWPVVEQPEAGWRGRTGTVLTAVLQDHGTLAEHDIYIAGRFEMAKIARDLFCSER 
NAREDRLFGDAFAFI " 

BASE COUNT 2621 a 2682 c 3073 g 2681 t 

ORIGIN 

1 agtcctgcag aatgaagggt gatttatgtg atttgcatca cttttggtgg gtaaatttat 
61 gcaacgcatt tgcgtcatgg tgatgagtat cacgaaaaaa tgttaaaccc ttcggtaaag 
121 tgtctttttg cttcttctga ctaaaccgat tcacagagga gttgtatatg tccaagtctg 
181 atgtttttca tctcggcctc actaaaaacg atttacaagg ggctacgctt gccatcgtcc 
241 ctggcgaccc ggatcgtgtg gaaaagatcg ccgcgctgat ggataagccg gttaagctgg 
301 catctcaccg cgaattcact acctggcgtg cagagctgga tggtaaacct gttatcgtct 
361 gctctaccgg tatcggcggc ccgtctacct ctattgctgt tgaagagctg gcacagctgg 
421 gcattcgcac cttcctgcgt atcggtacaa cgggcgctat tcagccgcat attaatgtgg 
481 gtgatgtcct ggttaccacg gcgtctgtcc gtctggatgg cgcgagcctg cacttcgcac 
541 cgctggaatt cccggctgtc gctgatttcg aatgtacgac tgcgctggtt gaagctgcga 
601 aatccattgg cgcgacaact cacgttggcg tgacagcttc ttctgatacc ttctacccag 
661 gtcaggaacg ttacgataet tactctggtc gcgtagttcg tcactttaaa ggttctatgg 
721 aagagtggca ggcgatgggc gtaatgaact atgaaatgga atctgcaacc ctgctgacca 
781 tgtgtgcaag tcagggcctg cgtgccggta tggtagcggg tgttatcgtt aaccgcaccc 
841 agcaagagat cccgaatgct gagacgatga aacaaaccga aagccatgcg gtgaaaatcg 
901 tggtggaagc ggcgcgtcgt ctgctgtaat tctcttctcc tgtctgaagg ccgacgcgtt 
961 cggccttttg tatttttgcg tagcgcctcg caggaaatgc ctttccaact ggacgtttgt 
1021 acagcacaat tctattttgt gcgggtaagt tgttgcgtca ggaggcgttg tggatttctc 
1081 aatcatggtt tacgcagtta ttgcgttggt gggtgtggca attggctggc tgtttgccag 
1141 ttatcaacat gcgcagcaaa aagccgagca attagctgaa cgtgaagaga tggtcgcgga 
12 01 gttaagcgcg gcaaaacaac aaattaccca aagcgagcac tggcgtgcag agtgcgagtt 
12 61 actcaataac gaagtgcgca gcctgcaaag tattaacacc tctctggagg ccgatctgcg 
1321 tgaagtaacc acgcggatgg aagccgcaca gcaacatgct gacgataaaa ttcgccagat 
1381 gattaacagc gagcagcgcc tcagtgagca gtttgaaaac ctcgccaacc gtatttttga 
1441 gcacagcaat cgccgggttg atgagcaaaa ccgtcagagt ctgaacagcc tgttgtcgcc 
1501 gctacgtgaa caactggacg gtttccgccg tcaggttcag gacagcttcg gtaaagaagc 
1561 acaagaacgc cataccctga cccacgaaat tcgcaatctc cagcaactca acgcgcaaat 
1621 ggcccaggaa gcgatcaacc tgacgcgcgc gctgaaaggc gacaataaaa cccagggcaa 
16 81 ctggggcgag gtagtattga cgcgggtgct ggaggcttcc ggtctgcgtg aagggtatga 
1741 atatgaaacc caggtcagca tcgaaaatga cgcccgctcg cggatgcagc cggatgtcat 
1801 cgtgcgcctg ccgcagggaa aagatgtggt gatcgacgcc aaaatgacgc tggtcgccta 
1861 tgaacgctat tttaacgccg aagacgacta cacccgcgaa agcgcgctac aggaacatat 
1921 cgcgtcggtg cgtaaccata tccgtttgct gggacgcaaa gattatcaac agctgccggg 
1981 gctgcgaact ctggattacg tgctgatgtt tattcccgtt gaacccgctt ttttactggc 
2041 gcttgaccgc cagccggagc tgatcaccga agcgttgaaa aacaacatca tgctggttag 
2101 cccgactacg ctgctggtgg cgctgcgcac tatcgccaac ctgtggcgtt atgagcatca 
2161 aagccgcaac gcccagcaaa tcgccgatcg tgccagcaag ctgtacgaca agatgcgttt 
2221 gttcatcgat gacatgtccg cgattggtca aagtctcgac aaagcgcagg ataattatcg 
2281 gcaggcaatg aaaaaactct cttcagggcg cggaaatgtg ctggcgcagg cagaagcgtt 
2341 tcgcggttta ggagtagaaa ttaaacgcga gattaatccg gatttggctg aacaggcggt 
2401 gagccaggat gaagagtatc gacttcggtc ggttccggag cagccgaatg atgaagctta 
2461 tcaacgcgat gatgaatata atcagcagtc gcgctagccc attgggagta gttaagccgg 
2521 gtagaaatct a'gggcatcga cgcccaatct gttacacttc tggaacaatt ttttgatgag 
2581 caggcattga gatggtggat aagtcacaag aaacgacgca ctttggtttt cagaccgtcg 
2641 cgaaggaaca aaaagcggat atggtcgccc acgttttcca ttccgtggca tcaaaatacg 
2701 atgtcatgaa tgatttgatg tcatttggta ttcatcgttt gtggaagcga ttcacgattg 
2761 attgcagcgg cgtacgccgt gggcagaccg tgctggatct ggctggtggc accggcgacc 
2821 tgacagcgaa attctcccgc ctggtcggag aaactggcaa agtggtcctt gctgatatca 
2881 atgaatccat gcccaaaatg ggccgcgaga agctgcgtaa tatcggtgtg attggcaacg 
2941 ttgagtatgt tcaggcgaac gctgaggcgc tgccgttccc ggataacacc tttgattgca 
3001 tcaccatttc gtttggtctg cgtaacgtca ccgacaaaga taaagcactg cgttcaatgt 
3061 atcgcgtgct gaaacccggc ggccgcctgc tggtgcttga gttctcgaag ccaattatcg 
3121 agccgctgag caaagcctat gatgcatact ccttccatgt gctgccgcgt attggctcac 
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3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
42 61 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 



tggtcgcgaa 
atcaggatac 
atctgacggc 
atgcctttta 
cgctcacccg 
gtaaaaggct 
ggcgaatggg 
cttcgcgatc 
gatattcagg 
gaactgctgg 
ggaggcgcaa 
attactgaag 
acggctgccg 
aaatgacgcc 
gacttgatga 
cattattctg 
ccctgcaaga 
atctttttcc 
ttgatggcaa 
cgtggtttga 
ccgcgcgatt 
tgccggttat 
tgctgccgga 
tgattgatga 
ttgaagacag 
tgatggtgat 
acggcactaa 
ttcgcgacag 
acccggaaaa 
aagataaacg 
tggcagagct 
aatttgccat 
ttggacatgt 
cgcaactggt 
atccgcaact 
aggtcggtat 
aaatgccaga 
acagtgttga 
gttattttct 
gacctgaatg 
tcggttggcg 
tgcggctttg 
ggcagttatt 
ccatcggttc 
aaccaaagca 
agcaggcgga 
agcaggtgta 
gcctcgccgc 
gattcgcgcg 
actccaggag 
gcccgaactg 
ctacgttgca 
ggtgaaagat 
cagttcgccg 
cgctgaaccg 
gtagaagata 
tgcattatcg 
cacctggtat 
gacgtggcct 
tcagcgccgg 



cgacgccgac 
cctgaaagcc 
aggggttgtg 
aacctttagt 
cgctgaaaac 
tttcgacgtc 
caggcgatgc 
gccagcagct 
tggtgcaaaa 
ccccttatac 
agttcctgca 
agtggcgtat 
tcgagcgtgc 
aggtgaagta 
actgatcccc 
gatgccaaat 
actggggccg 
accgcatatt 
gctggcgaag 
cgattttgaa 
gaaatcgaat 
taaagcggat 
tggtcgccgt 
actgaatttg 
cccgatgctc 
ggagcgcatt 
catgaaattg 
ctttttccat 
cccgaaatat 
ctatctggca 
acacgtcgat 
tcgtacggtc 
actgttaaat 
gttactccag 
cgatttatgg 
tcctgcgctg 
actgcctgaa 
taagattgcc 
cggaattggc 
ggggctgatg 
caaaacacgc 
tttaatcatc 
gattattgcc 
cgatcttggt 
ggataaaacc 
tacgaatcag 
atccgtgttt 
cttctggggc 
ttgcgttcac 
tttcaggaca 
aaagcgtcga 
aacgatcctg 
aatgaagctg 
gaacagaagc 
aaaaccgctg 
ctcaaccgct 
cggtgatcgt 
ccgcgccatt 
cgccgttctt 
tgattctcta 



agctaccgtt 
atgatgcagg 
gcgctgcatc 
gacggcagga 
ggcccgctcg 
attgattctg 
tgactgcacc 
taccgcactg 
cttcgttgcg 
cggtgatatc 
tcacggcatt 
ggcacccggt 
tgttgatgcc 
cggcgcctat 
aaaatgcgta 
cggcataaag 
gtttggatca 
gccgatcagc 
cagcagattg 
atcaagccgc 
ggtaaagagg 
ctgaaactta 
ctgcgcccaa 
ctgcgggaat 
tacatcccgg 
tacggcattc 
ctggcggaac 
gccgatatgc 
atcggcattg 
gaaaacttta 
tctggctggg 
tgtgaaccta 
ctgtttaata 
aaaaccctgc 
aaaacggcga 
gtgagagcat 
ttggtttacg 
cgcgagcttc 
gctacgttag 
cccggctggt 
tgattttttc 
atctaccaca 
gtcatcgttg 
gcgtcgatca 
agtcaggatg 
gaacaggcta 
gatatcggtt 
cgcaacgact 
tggcgacaac 
gtctgaaaaa 
tggatgaact 
aaaaggcgag 
cgcatgaggg 
cagaaaccac 
caccttcccc 
tatcacgcat 
gatattcctg 
gatcaagcag 
tacgccgatc 
tcaggtgtgg 



atctggcaga 
atgccggatt 
gtggttataa 
attgaaagtc 
cgtctgctgg 
gtgttcagcg 
gttatcgcct 
attcgcagtg 
ctggcagatc 
gccgctgaag 
aagcgccagc 
ccgcttgaag 
ctgaccaaac 
atttcatcat 
tcaccctgcc 
acaaactttt 
agttcgggca 
tggcgttatt 
aagctgcaat 
tggcttctgc 
tggtgattaa 
tctaccgtct 
ccgaagtggt 
ctgccaacgc 
aagtttaccc 
cggtgtctga 
gcggcgtgca 
accctggcaa 
attgcgggat 
tcgccttctt 
tgccaccaga 
tctttgagaa 
cggcgcgtcg 
tctacgtcga 
agcctttcct 
ttaaagaaaa 
acagtttgcg 
agtcaaatca 
tattaagtgg 
taatggcagg 
atcgctcaag 
gaggaacatg 
tactgctttt 
aaggctttaa 
ctgattttac 
aaacagaaga 
ttagcgaact 
gcctgtggcg 
ggtgcagaac 
ggttgaaaag 
acgccaggcc 
cgatgaagcg 
cgtaacgcct 
gccagagccg 
ttcgtcgagt 
ctgattgagc 
tgtctggtct 
ttgccgcaag 
aagctgacct 
gcatttatcg 



atccatccgt 
cgaaagtgtc 
gttctgacag 
tgctcaacac 
gtaaagtatt 
aacgccaggt 
acgccagtgt 
gtgagctgga 
tggcagagtt 
gaatcagcaa 
aacgttatgt 
tggcctggtt 
ggctggaaaa 
tcgcactttt 
gctacggcta 
aggtgagcga 
aatgttatca 
gcaggacaaa 
gggcggcttg 
ttctatcgcc 
agtcatccgc 
ggctcgctgg 
gcgcgagtac 
cattcagctt 
tgactattgt 
tgttgcggcg 
ggtgttcttc 
catcttcgta 
tgttggctcg 
taatcgcgac 
taccaacgtt 
accgctggcc 
cttcaatatg 
a 9999tagga 
ggagtcgtgg 
agcgccgttc 
ccagggcaag 
tgtacgtcag 
cacattcttg 
tggtctgatc 
gcgggccgtg 
tatgggtggt 
tggcaccaaa 
aaaagcaatg 
tgcgaaaact 
cgcgaagcgc 
gctattggtg 
gtaaaaacgg 
gaactgaccc 
gcgagcctca 
gcggagtcga 
cacaccatcc 
gccgctgcac 
gtggtaaaac 
gataaaccgt 
tgcgtaagcg 
atttcgccaa 
gttcaacgat 
ttatggtgtc 
ccccagcgct 



atgcatcccg 
gactactaca 
gagaccggaa 
cttcctgtat 
gcgcgtggag 
tgatgtactg 
gttgccgaaa 
agtgcagggc 
cgaccctgcg 
agccatgcgc 
ggcggaagcc 
tgcggaagag 
actggaggct 
ttaagctacg 
tggcgatact 
ctacgactgg 
acccgccgcg 
gttgctccgt 
ccggtagaag 
caggttcata 
ccggatattt 
gtgccgcgtt 
gaaaagacat 
cggcgcaatt 
a 9tgaaggga 
ctggagaaaa 
actcaggtct 
agctatgaac 
ctaaacaaag 
tatcgcaaag 
gaagagttcg 
gaaatttcgt 
gaagtgcagc 
cgccagcttt 
attaaagatc 
tgggtcgaaa 
tatttacagc 
ggacaatcgc 
ttggtcagcc 
gcctggtttg 
taacgtataa 
atcagtattt 
aagctcggct 
agcgatgatg 
atcgccgata 
cacgataaag 
ttcatcatcg 
tagcgggctg 
aggagttaaa 
ctaacctgac 
tgaagcgttc 
ataacccggt 
aaacgcaggc 
ctgctgcgga 
aaacatgtct 
tctgctgaac 
tgacatctat 
gatcgccacc 
gctgattctg 
gtataagcat 
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6781 gaacgtcgcc tggtggtgcc gctgctggtt 
6841 gcattcgcct actttgtggt ctttccgctg 
6901 gaaggggtgc aggtatccac cgacatcgcc 
6961 atggcgtttg gtgtctcctt tgaagtgccg 
7021 attacctcgc cagaagactt acgcaaaaaa 
7081 gtcgggatgt tgctgacgcc gccggatgtc 
7141 tactgtctgt ttgaaatcgg tgtcttcttc 
72 01 cgggaagagg aaaacgacgc tgaagcagaa 
7261 cccgtcaggg cggttgtcat atggagtaca 
7321 gttcgcaatt tgcgaaagac cgtgatgatg 
7381 atgggctact catcaccggc actaacctgc 
7441 gtcagtattc gtcctgttgg tcaacggcgg 
7501 aagctgcgac tgaagaagcg attattgagc 
7561 gtgaatgtgg tctcgacttt aaccgcaact 
7621 ttgttgccca gctacgcatt gccgcagatt 
7681 atgcccacga gcggtttatg acattgctgg 
7741 ttcttcattg ctttaccggc acacgcgaag 
7801 atatcggcat taccggttgg gtttgcgatg 
7 961 ttgecgttga ttccggcgga aaaattactg 
7921 cgcgatctca cgccaaagcc atcatcccgg 
7981 ttgcaacgta ttgcgcactg gcgtggagaa 
8041 gctaatgtca aaacactgtt tgggattgcg 
8101 acactgtgct taatctcttt attaataaga 
8161 tcgggttcgg tgaaaatggc ctgaaagcct 
8221 tcacccggat aaggggttgc cggatcgaca 
8281 tgaataaccg ccgatgggac tatcgctggc 
8341 ccgcgggtcg cgttgatagt cgtggtatga 
8401 tagttgggga acaatggctc actgactgca 
8461 gtgatcatcg gtgccaggca attcacagcc 
8521 agttgcccgc gcttgcagta cagtaaatac 
8581 atcggggcgc aaggatagca aaagctttac 
8641 tataccgtca gagttcacgc taatttaaca 
8701 ataatgggcg cagattaaga ggctacaatg 
8761 ttcttgacgc tgcttgaaca gcagggtgag 
8821 catctggaaa tcactgaaat tgctgaccgc 
8881 ttcgaaaacc ctaaaggcta ctcaatgccg 
8941 cgcgtggcga tgggcatggg gcaggaagat 
9001 ttggcgtttc tgaaagagcc ggagccgcca 
9061 ccgcagttta agcaagtatt gaacatgccg 
9121 caaaaaatcg tctctggcga tgacgtcgat 
9181 ccggaagatg ccgcgccgct gattacctgg 
9241 gagcggcaga atctgggcat ttatcgccag 
9301 cgctggctgt cgcatcgcgg cggcgcgctg 
9361 ggcgaacgtt tcccggtttc tgtggcgctg 
9421 gtcactcccg ttccggatac gctttcagag 
9481 aagaccgaag tggtgaagtg tatctccaat 
9541 gtgctggaag ggtatatcga acaaggcgaa 
9601 accggttact ataatgaagt cgatagtttc 
9661 cgtgaagatg cgatttacca ttccacctat 
9721 ctgggtgtcg cactgaacga agtgtttgtg 
9781 gtcgattttt acctgccgcc ggaaggctgc 
9841 aaacagtacg ccggacacgc gaagcgcgtc 
9901 tttatgtaca ctaaatttgt gatcgtttgc 
9961 gatgtgattt gggcgattac cacccgtatg 
10021 aatacgccta ttgattatct ggattttgcc 
10081 gggctggatg ccacgaataa atggccgggg 
10141 aaaaaagatc cagatgttgt cgcgcatatt 
10201 aacaacggta aaagcgcctg atgcgcgttt 
10261 aagcgcatga caaccttaag ctgtaaagtg 
10321 tatcgtgtcc gcatcgtgcc agacgcggcc 



tccagctctc tgctgtttta tatcggcatg 
gcatttggct tccttgccaa taccgcgccg 
agctatttaa gcttcgttat ggcgctgttt 
gtagcaattg tgctgctgtg ctggatgggg 
cgcccgtatg tgctggttgg tgcattcgtt 
ttctcgcaaa cgctgttggc gatcccgatg 
tcacgctttt acgttggtaa agggcgaaat 
agcgaaaaaa ctgaagaata aattcaaccg 
ggatgtttga tatcggcgtt aatttgacca 
ttgtagcgtg cgcttttgac gcgggagtta 
gtgaaagcca gcaggcgcaa aagctggcgc 
gcgtacatcc tcacgacagc agccagtggc 
tggccgcgca gccagaagtg gtggcgattg 
tttcgacgcc ggaagagcag gaacgcgctt 
taaacatgcc ggtatttatg cactgtcgcg 
agccgtggct ggataaactg cctggtgcgg 
agatgcaggc gtgcgtggcg catggaattt 
aaccgacgcg gactggagct gcgggaactt 
atcgaaactg atgcgccgta tctgctccct 
cgcaacgagc cagcccatct gccccatatt 
gatgccgcat ggctggctgc caccacggat 
ttttagagtt tgcggaactc ggtattcttc 
ttaagcaata gcatggagcg agcctcacca 
tcgaacgcgc cttcggtaat aatcacctta 
atgtctttcg gtttatatac cgatagctga 
gacgcgccaa agcgcacgaa gtggctgaca 
atcacttctg ggtcaaattc cacaaacagg 
gtacgttttc cacgcacgat tttttccagg 
tgtctttcga ggtgttcctg ggcacgttga 
caggattgca taatgactct tatccgttta 
gctaagttaa ttatattccc cggtttgcgt 
aatttacagc atcgcaaaga tgaacgccgt 
gacgccatga aatataacga tttacgcgac 
ctaaaacgta tcacgctccc ggtggatccg 
actttgcgtg ccggtgggcc tgcgctgttg 
gtgctgtgca acctgttcgg tacgccaaag 
gtttcggcgc tgcgtgaagt tggtaaatta 
aaaggtttcc gcgacctgtt tgataaactg 
acaaagcggc tgcgtggtgc gccctgccaa 
ctcaatcgca ttcccattat gacctgctgg 
gggctgacag tgacgcgcgg cccacataaa 
cagctgattg gtaaaaacaa actgattatg 
gattatcagg agtggtgtgc ggcgcatccg 
ggtgccgatc ccgccacgat tctcggtgca 
tatgcgtttg ccggattgct acgtggcacc 
gatcttgaag tgcccgccag tgcggagatt 
actgcgccgg aagggccgta tggcgaccac 
ccggtattta ccgtgacgca tattacccag 
accgggcgtc cgccagatga gcccgcggtg 
ccgattctgc aaaaacagtt cccggaaatt 
tcttatcgcc tggcggtagt gacaatcaaa 
atgatgggcg tctggtcgtt cttacgccag 
gatgatgacg ttaacgcacg cgactggaac 
gacccggcgc gggatactgt tctggtagaa 
tcgcctgtct ccgggctggg ttcaaaaatg 
gaaacccagc gtgaatgggg acgtcccatc 
gacgccatct gggatgaact ggctattttt 
gttttgccct atttatcgat ccgacagaga 
acctcggtag aagctatcac ggataccgta 
ttttcttttc gtgctggtca gtatttgatg 
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10381 gtagtgatgg atgagcgcga caaacgtccg ttctcaatgg cttcgacgcc ggatgaaaaa 

10441 gggtttatcg agctgcatat tggcgcttct gaaatcaacc tttacgcgaa agcagtcatg 

10501 gaccgcatcc tcaaagatca tcaaatcgtg gtcgacattc cccacggaga agcgtggctg 

10561 cgcgatgatg aagagcgtcc gatgattttg attgcgggcg gcaccgggtt ctcttatgcc 

10621 cgctcgattt tgctgacagc gttggcgcgt aacccaaacc gtgatatcac catttactgg 

10681 ggcgggcgtg aagagcagca tctgtatgat ctctgcgagc ttgaggcgct ttcgttgaag 

10741 catcctggtc tgcaagtggt gccggtggtt gaacaaccgg aagcgggctg gcgtgggcgt 

10801 actggcaccg tgttaacggc ggtattgcag gatcacggta cgctggcaga gcatgatatc 

10861 tatattgccg gacgttttga gatggcgaaa attgcccgcg atctgttttg cagtgagcgt 

10921 aatgcgcggg aagatcgcct gtttggcgat gcgtttgcat ttatctgaga tataaaaaaa 

10981 cccgcccctg acaggcggga agaacggcaa ctaaactgtt attcagtggc atttagatct 

11041 atgacgtatc tggcaaa 
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r 1: AE000167. Escherichia coli ...[gi: 1786836] 



Links 



LOCUS AE000167 10264 bp DNA linear BCT 01-DEC-2000 

DEFINITION Escherichia coli K12 MG1655 section 57 of 400 of the complete 
genome . 

ACCESSION AE000167 U00096 
VERSION AE000167.1 GI:1786836 

KEYWORDS 

SOURCE Escherichia coli K12 

ORGANISM Escherichia coli K12 

Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales ; 
Enterobacteriaceae ; Escherichia . 
REFERENCE 1 (bases 1 to 102 64) 

AUTHORS Blattner, F.R. , Plunkett,G. Ill, Bloch,C.A., Perna,N.T., Burland,V., 
Riley, M., Collado-Vides , J . , Glasner, J.D . , Rode,C.K., Mayhew,G.F., 
Gregory., Davis, N.W., Kirkpatrick, H . A. , Goeden,M.A., Rose, D. J., 
Mau, B . and Shao,Y. 
TITLE The complete genome sequence of Escherichia coli K-12 

JOURNAL Science 277 (5331), 1453-1474 (1997) 
MEDLINE 97426617 
PUBMED 9278503 
REFERENCE 2 (bases 1 to 102 64) 
AUTHORS Blattner , F . R . 
TITLE Direct Submission 

JOURNAL Submitted ( 16 - JAN-1997) Guy Plunkett III, Laboratory of Genetics, 
University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. 
Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 
608-263-7459 
REFERENCE 3 (bases 1 to 10264) 

AUTHORS Blattner , F . R . 

TITLE Direct Submission 

JOURNAL Submitted ( 02 -SEP- 1997 ) Guy Plunkett III, Laboratory of Genetics, 
University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. 
Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 
608-263-7459 
REFERENCE 4 (bases 1 to 10264) 

AUTHORS Plunkett, G. III. 

TITLE Direct Submission 

JOURNAL Submitted ( 13 -OCT- 1998 ) Laboratory of Genetics, University of 
Wisconsin, 445 Henry Mall, Madison, WI 53706, USA 
COMMENT This sequence was determined by the E. coli Genome Project at the 

University of Wisconsin-Madison (Frederick R. Blattner, director) . 
Supported by NIH grants HG00301 and HG01428 (from the Human Genome 
Project and NCHGR) . The entire sequence was independently 
determined from E. coli K12 strain MG1655 . Predicted open reading 
frames were determined using GeneMark software, kindly supplied by 
Mark Borodovsky, Georgia Institute of Technology, Atlanta, GA, 
30332 [e-mail: mark@amber.gatech.edu]. Open reading frames that 
have been correlated with genetic loci are being annotated with CG 
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Site Nos., unique ID nos . for the genes in the E. coli Genetic 
Stock Center (CGSC) database at Yale University, kindly supplied by 
Mary Berlyn. A public version of the database is accessible 
( http : //cgsc . biology . yale . edu ) . Annotation of the genome is an 
ongoing task whose goal is to make the genome sequence more useful 
by correlating it with other data. Comments to the authors are 
appreciated. Updated information will be available at the.E. coli 
Genome Project's World Wide Web site 

( http : //www . genetics . wise . edu ) . *** The E. coli K12 sequence and 
its annotations are periodically updated; this is version M54 . No 
sequence changes. Annotation updates: updated gene identifications 
and products; all new functional assignments courtesy of Monica 
Riley; added promoters, protein binding sites, and repeated 
sequences described in reference 1. The unique numeric identifiers 
beginning with a lowercase 'b f assigned to each gene (protein- or 
RNA-encoding) are now designated as gene synonyms instead of 
labels. This should allow them to be searched for in Entrez as gene 



names . 



source 



promoter 



gene 



CDS 



gene 



CDS 



Lou a t i on / Q u a .Liners 
1 . . 10264 

/organism= "Escherichia coli K12" 
/mol_type= "genomic DNA" 
/strain="K12" 
/sub_strain="MG1655" 
/db_xref="taxon: 833 33" 
64. .92 

/note="f actor Sigma70; predicted +1 start at 651381" 

176.. 1834 

/gene=" citA" 

/ no te=" synonym: b0619" 

176 . . 1834 

/gene="citA" 

/function= "putative regulator; Not classified" 
/note="o552 was o455; This 455 aa ORF is 29 pet identical 

(15 gaps) to 441 residues of an approx. 544 aa protein 
YJDH_ECOLI SW: P39272;This 77 aa ORF is 28 pet identical 

(2 gaps) to 59 residues of an approx. 4 00 aa protein 
YWFF_BACSU SW : P39642" 
/ codon_start=l 
/transl table=ll 

/product^ "putative sensor-type protein" 
/protein_id= " AAC73720 . 1 " 
/db_xref ="GI : 1786837" 

/ 1 rans 1 a t i on= " MLQLNENKQFAFFQRLAFPLRI FLL ILVFS I FVI AALAQ YFTAS 

FEDYLTLHVRDMAMNQAKIIASNUSVISAVKTRDYKRLATIANKLQRDTDFDYVVIGD 

RHSIRLYHPNPEKIGYPMQFTKQGALEKGESYFITGKGSMGMAMRAKTPIFDDDGKVI 

GWSIGYLVSKIDSWRAEFLLPMAGVFVVLLGILMLLSWFLAAHIRRQMMGMEPKQIA 

RVVRQQEALFSSVYEGLIAVDPHGYITAINRNARKMLGLSSPGRQWLGKPIVEWRPA 

DFFTEQIDEKRQDWANFNGLSVIANREAIRSGDDLLGAIISFRSKDEISTLNAQLTQ 

IKQYVESLRTLRHEHLNWMSTLNGLLQMKEYDRVLAMVQGESQAQQQLIDSLREAFAD 

RQVAGLLFGKVQRARELGLKMIIVPGSQLSQLPPGLDSTEFAAIVGNLLDNAFEASLR 

SDEGNKIVELFLSDEGDDWIEVADQGCGVPESLRDKIFEQGVSTRADEPGEHGIGLY 

L I AS YVTRCGGVI TLEDNDPCGTLFS I Y I PKVKPNDS S INP IDR " 

1803 . .2483 

/gene="citB" 

/note=" synonym: b0620" 

1803 . .2483 

/gene="citB" 

/function= "putative regulator; Not classified" 
/note="o226; 100 pet identical to Shigella flexneri 
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protein GB: SFU29654JL ACCESSION : U29654" 
/ codon_s t ar t = 1 
/trans l_t ab 1 e = 11 

/product=" sequence similarity to Shigella regulator" 
/protein id=" AAC73721 . 1 " 
/db_xref="GI : 1786838" 

/ translations "MTAPLTLLIVEDETPLAEMHAEYIRHIPGFSQILLAGNLAQARM 
MI ERFKPGL I LLDNYLPDGRGINLLHELVQAHYPGD WFTTAASDMETVSEAVRCGVF 
DYLIKPIAYERLGQTLTRFRQRKHMLESIDSASQKQIDEMFNAYARGEPKDELPTGID 
PLTLNAVRKLFKEPGVQHTAETVAQALTISRTTARRYLEYCASRHLIIAEIVHGKVGR 
PQRIYHSG" 

gene complement (2524 . . 3 909) 

/gene= n dcuC" 

/note= " synonym : b0621" 
CDS complement (2524 . .3909) 

/gene= "dcuC " 

/function= " transport ; Transport of small molecules: 
Carbohydrates, organic acids, alcohols" 

/note="f 461; This 461 aa GRF is 36 pet identical (6 gaps) 
to 441 residues of an approx. 456 aa protein YHCL_EC0LI 
SW: P45428" 
/codon_start=l 
/transl table=ll 

/products" transport of dicarboxylates " 
/protein id=" AAC73722 . 1 " 
/db_xref="GI: 1786839" 

/ trans 1 at i on= " MLTFIELL IGVWI VGVARYI I KGYS ATGVLFVGGLLLL 1 1 S AI 
MGHKVLPS SQASTGYS ATD I VE YVKI LLMSRGGDLGMM IMMLCGFAAYMTH IGANDMV 
VKLASKPLQYINSPYLLMIAAYFVACLMSLAVSSATGLGVLLMATLFPVMVNVGISRG 
AAAAICASPAAI ILAPTSGDWLAAQASEMSLIDFAFKTTLPIS IAAI IGMAIAHFFW 
QRYLDKKEHISHEMLDVSEITTTAPAFYAILPFTPIIGVLIFDGKWGPQLHIITILVI 
CMLIASILEFLRSFNTQKVFSGLEVAYRGMADAFANVVMLLVAAGVFAQGLSTIGFIQ 
SLISIATSFGSASIILMLVLVILTMLAAVTTGSGNAPFYAFVEMIPKLAHSSGINPAY 
LTI PMLQASMLGRTLS PVSGVWAVAGMAKI S PFEWKRTSVPVLVGLVI VIVATELM 
VPGTAAAVTGK " 

promoter complement (3939. .3967) 

/note="f actor Sigma70; predicted +1 start at 655214" 

protein bind complement (4051 . .4076) 

/note="central position to predicted promoter: -107" 
/bound_moiety="ArsR predicted site" 

promoter 4344.-4372 

/note= M f actor Sigma70; predicted +1 start at 655661" 

promoter 4443.. 4471 

/note="f actor Sigma70; predicted +1 start at 655760" 

g ene 4498 . . 5058 

/gene="crcA" 

/note= " synonym : bO 62 2 " 

CDS 4498 . . 5058 

/gene="crcA" 

/function= "orf ; Unknown" 

/note="ol86; 100 pet identical to GB : EC0GICA_1 ACCESSION: 

L29054; 100 pet. identical to 156 amino acids of 

YBEG_ECOLI SW: P37001 but contains 30 additional 

N- terminal residues" 

/codon_start=l 

/trans l_table= 11 

/product= "orf, hypothetical protein" 
/protein id=" AAC73723 . 1 " 
/db_xref="GI : 1786840" 

/translations "MNVSKYVAIFSFVFIQLISVGKVFANADEWMTTFRENIAQTWQQ 
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PEHYDLYIPAITWHARFAYDKEKTDRYNERPWGGGFGLSRWDEKGNWHGLYAMAFKDS 
V^KWEPIAGYGWESTWRPLADENFHLGLGFTAGVTARDNWNYIPLPVLLPLASVGYGP 
VTFQMTYIPGTYNWGNVYFAWMRFQF " 
5084 . .5112 

/note= M factor Sigma70; predicted +1 start at 656401" 

5233 . .5442 

/gene= "cspE" 

/note=" synonym: b0623 " 

5233 . .5442 

/gene="cspE M 

/f unction="phenotype ; Not classified" 

/note="o69; 100 pet identical to CSPE_ECOLI SW: P36997" 

/codon_start=l 

/ transl_table=ll 

/product= "cold shock protein" 

/protein_id=" AAC73724 .1 " 

/db_xref="GI : 1786841" 

/trans la tion= "MSKIKGNVKWFNESKGFGFITPEDGSKDVFVHFSAIQTNGFKTL 

AEGQRVE FE I TNGAKG P 3 AANVI AL t! 

complement (5496 . . 5879) 

/gene= M crcB" 

/riote=" synonym: b0624" 

complement (5496 . .5879) 

/gene="crcB" 

/ f unc t ion= " or f ; Unknown " 

/note="fl27; formerly designated ybel" 

/codon_start=l 

/ 1 r ans 1_ t able = 11 

/product="orf , hypothetical protein" 
/protein_id= " AAC73725 . 1 " 
/db_xref="GI : 1786842" 

/ trans la tion= "MLQLLLAVFIGGGTGSVARWLLSMRFNPLHQAIPLGTLTANLIG 
AFIIGIGFAWFSRMTNIDPVWKVLITTGFCGGLTTFSTFSAEWFLLQEGRFGWALLN 
VF VNLLG S F AMT ALAF WL F S AS T AH " 
5870 . . 5899 

/note="f actor Sigma70; predicted +1 start at 657188" 
complement (5945 . .5972) 

/note=" factor Sigma70; predicted +1 start at 657220" 

5972 . . 6199 

/gene="ybeH" 

/note- " synonym : bO 6 2 5 " 

5972. .6199 

/gene= f, ybeH" 

/f unction="orf ; Unknown" 

/note= n o75; 100 pet identical to YBEH_ECOLI SW: P39874" 

/codon_start=l 

/transl table=ll 

/product="orf , hypothetical protein" 
/protein id= " AAC73726 . 1 " 
/db_xref = "GI : 1786843 " 

/ 1 rans 1 a t ion= " MLVAAGQFAVTS VWEKNAE I CAS L MAQ AAEND AS L F AL P E AL LA 

RDDHDADLS VKSAQLLEGEFLGLYGEKVNVT " 

6196 . . 6759 

/gene="ybeM" 

/note=" synonym: b0626" 

6196. .6759 

/gene= "ybeM" 

/f unction="putative enzyme; Not classified" 

/note="ol87; This 187 aa ORF is 40 pet identical (33 gaps) 

to 165 residues of an approx. 312 aa protein YJM6_YEAST 
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SW: P47016" 

/codon_start=l 

/transl_table=ll 

/product = "putative amidase" 

/protein id=" AAC73727 . 1 " 

/db_xref="GI : 1786844" 

/translation= "MMTTILTIHVPSTPGRAWNMLVALQAGNIVARYAKLHLYDAFAI 
QESRRVDAGNEIAPLLEVEGMKVGLMTCYDLRFPELALAQALQGAEILVTjPAAWWGP 
LKEHHWSTLLAARALDTTCYIWAAGECGNKNIGQSRIIDPFGVTIAAASEMPALIMAE 
VTPERVRQVRAQLPVLNNRRFAPPQLL " 
6777 . .6806 

/note="f actor Sigma70; predicted +1 start at 658095" 

6888 . .7091 

/gene="ybeC" 

/note= " synonym : bO 62 7 " 

6888 . .7091 

/gene=" ybeC" 

/f unction="orf ; Unknown" 



3Ct identical to YBEc -ccO-Lii. £>w : F/iByt»' 



/ codon_start=l 
/trans l_table= 11 

/product="orf , hypothetical protein" 
/protein _id=" AAC73 72 8 . 1 " 
/db_xref ="GI : 1786845 " 

/trans 1 a t ion= " MGEI S ITKLLWAALWLLFGTKKLRTLGGDLGAAIKGFKKAMN 

DDDAAAKKGADVDLQAEKLSHKE " 

complement (7192 . . 8157) 

/gene="lipA" 

/note= " synonym : bO 62 8 " 

complement (7192 . .8157) 

/gene="lipA" 

/function= "putative enzyme; Biosynthesis of cof actors, 
carriers : Lipoate " 

/note="f321; 99 pet identical to LIPA_ECOLI SW : P25845" 

/codon_start=l 

/ 1 r ans l_t able = 11 

/product=" lipoate synthesis, sulfur insertion?" 
/protein id=" AAC73729 , 1 " 
/db_xref ="GI : 1786846 " 

/ 1 r ans 1 a t i on= " MSKP I VMERGVKYRDADKMAL I PVKNVATEREALLRKPEWMKI K 

LPADSTRIQGIKAAMRKNGLHSVCEEASCPNLAECFNHGTATFMILGAICTRRCPFCD 

VAHGRPVAPDANEPVKLAQTIADMALRYVVITSVT)RDDLRDGGAQHFAPCITAIREKS 

PQIKIETLVPDFRGRMDRALDILTATPPDVFNHNLENVPRIYRQVRPGADYNWSLKLL 

ERFKEAHPEIPTKSGLMVGLGETNEEIIEVMRDLRRHGVTMLTLGQYLQPSRHHLPVQ 

RYVSPDEFDEMKAEALAMGFTHAACGPFVRSSYHADLQAKGMEVK" 

complement (82 51 . .82 80) 

/note="f actor Sigma70; predicted +1 start at 659526" 

complement (83 66 . .9166) 

/gene=" ybeF" 

/note= " synonym : b062 9 " 

complement (83 66 . .9166) 

/gene="ybeF" 

/function= "putative regulator; Not classified" 

/note="f266; This 266 aa ORF is 99 pet identical (1 gap) 

to 266 residues of an approx. 320 aa protein YBEF_ECOLI 

SW: P30979" 

/codon_start=l 

/transl_table=ll 

/product= "putative transcriptional regulator LYSR-type" 
/protein id=" AAC73730 . 1 " 
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/db_xref ="GI : 1786847 " 

/ trans 1 a t i on= " MLNLTPSAI SQS IQKLRVI FPDPLFIRKGQGVTPTAFAMHLHE Y 
I SQGLES ILGALD I EGS YDKQRTITI ATTPS VGALVLPVI YRAI KTHYPQLLLRNPPI 
SDAENQLSQFQTDLIIDNMFCTNRTVQHHVLFTDNMVLICREGNPLLSLEDDRETIDN 
AAHVLLLPEEQNFSGLRQRVQEMFPDRQINFTSYNILTIAALVANSDMLAIIPSRFYN 
LFSRCWPLEKLPFPSLNEEQIDFSIHYNKFSLRDPILHGVIDVIRNAF" 

gene complement (9578 . .10153) 

/gene="lipB " 
/ no te=" synonym: b063 0" 

CDS complement (9578 . .10153) 

/gene="lipB M 

/f unction=" enzyme; Biosynthesis of cofactors, carriers: 
Lipoate" 

/note="fl91; 99 pet identical to LIPB_ECOLI SW: P30976" 

/codon_start=l 

/transl table=ll 

/product= "protein of lipoate biosynthesis" 
/protein_id= " AAC73731 . 1 " 
/db_xref = !! GI : 1786848 " 

/translation^' MHE FTDTRDD S TLDE I WL VEHYPVFTQGQAGKAEH I LMPGD I P V 
IQSDRGGQVTYHGPGQQVMYVLLNLKRRKLGVRELVTLLEQTVVNTLAELGIEAHPRA 
DAPGVYVGEKKICSLGLRIRRGCSFHGLALNVNMDLSPFLRINPCGYAGMEMAKISQW 
KPEATTNNIAPRLLENILALLNNPDFEYITA" 

BASE COUNT 2773 a 2298 c 2547 g 2646 t 

ORIGIN 

1 tttaatgatt ttaagttttt taattaatgt aattacgaaa tgactcgcag gtttaagtga 
61 tttaattgat ttaatgaata aaatttgeca cgatcataat taatatctat gtattttgat 
121 tcaacatttt aattacatcc gtcaaagagg ctegggacaa cccgcaagga aaacaatgtt 
181 gcagcttaac gagaataaac agtttgcatt tttccaaaga ctggcatttc egctgegtat 
241 etttttgetg attctggtgt tctcaatatt tgtcattgea gccctggcgc aatattttac 
301 ggccagtttt gaggactatt taaegcttea tgtacgegae atggcaatga ateaggegaa 
361 aattattgee tccaatgaca gtgtcatctc tgcggtgaaa aegegtgact acaaaegget 
421 ggcgaccatc gctaacaaat tacaaagaga taccgatttt gattatgtgg tgattgggga 
481 ccggcactcg atccgccttt accatcctaa teeggagaaa attggttatc etatgeagtt 
541 caccaaacag ggcgcgctgg agaaagggga gagctacttc attaceggga aagggtcaat 
601 ggggatggcg atgcgcgcca aaacgccaat ctttgatgac gatggaaaag teateggegt 
661 ggtgtcgatt ggctacctgg tgagtaaaat cgatagctgg egggctgagt ttttattacc 
721 gatggcaggt gtgtttgtcg tgctgttagg gattctgatg ttgctgtcgt ggttcctggc 
781 cgcgcatatc egteggcaga tgatgggcat ggagccaaag caaatcgcac gcgtggtccg 
841 tcagcaagag gcgctgttta gttcggttta tgaagggctg attgcggtgg ateegcatgg 
901 ttacattacc gccatcaatc gtaacgcaag aaagatgctg gggctgagtt cccccggacg 
961 gcaatggttg ggtaaaccca ttgttgaagt ggtcaggccc gecgatttet ttaccgaaca 
1021 gattgatgaa aaaegtcagg atgtggtggc gaactttaac ggtctgagcg ttattgccaa 
1081 cegggaaget attcgttcag gtgatgattt getgggggee attatcagct ttcgtagtaa 
1141 agacgaaatt tccaccctca atgegcaact gaegcaaata aaacaatacg ttgagagect 
1201 tegtacattg cgacacgagc atctcaattg gatgtcgacg ctcaatggtc tgttgcagat 
1261 gaaagagtat gatcgcgtgc tggcgatggt gcagggggag tctcaggccc agcaacagct 
1321 tattgacagc ctgegegagg cgtttgccga tegecaggtg gcggggctgc tttttggtaa 
1381 agtgcagcgc gcccgggaac tggggctaaa aatgatcatt gtccccggta gccagctttc 
1441 gcaactgccg ccaggactgg atagcaccga gtttgeagee attgtgggca atttacttga 
1501 taacgccttc gaagccagcc tgegtagega tgaaggaaac aagatcgttg aattattcct 
1561 cagegatgaa ggcgatgatg tggtgattga agtcgecgat cagggctgeg gcgttccaga 
1621 gtctctacga gacaaaatat ttgagcaggg ggtcagtacg cgtgctgacg agcccggtga 
1681 acatggcatt gggttgtact tgattgccag etaegtaacg cgctgcggtg gtgttatcac 
1741 tctcgaagat aatgatccct geggtacett attttcaatc tatattccga aagtgaaacc 
1801 taatgacagc tccattaacc ctattgatcg ttgaggacga aacgccgctg geagagatge 
1861 atgeggaata tattegtcac attcceggat tcagtcagat attactggcg ggaaatctgg 
1921 cgcaggcccg aatgatgatc gagegtttta ageegggget aatcttgetc gataactatc 
1981 ttcctgaegg tagagggatt aatttactgc atgaactggt geaggegcat tatcccggcg 
2041 acgtggtgtt taccactgca gecagegata tggaaacggt gtctgaagee gtacgttgtg 
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2101 gtgtatttga ttatctcatt aagcccattg 
2161 gtttccgcca gcgtaaacat atgctggaaa 
2221 atgagatgtt taatgcttat gctcgcggtg 
2281 accccttgac gctaaacgcg gtgcgaaaac 
2341 cagaaacggt ggcgcaggca ctgaccatca 
2401 attgcgccag ccgccatctg attattgctg 
2461 aacgcatata ccacagtggg tgacatgata 
2521 tatttacttg cctgtgaccg ctgctgccgt 
2581 aacaatcacc aaaccaacaa gcaccggtac 
2641 gatcttcgcc atcccggcaa ccgcaacgac 
2701 aaggtttgac gcctgcagca tcgggatagt 
2761 cgccagtttc gggatcatct caacaaacgc 
2821 gactgccgcc agcattgtca gaatcaccaa 
2881 aaacgaggta gcgatagaaa tcagactttg 
2941 gaatacccca gcggcaacca gcagcatcac 
3001 ataagccact tccagaccag agaaaacttt 
3061 aatggaggca atcagcatac aaatcaccag 
3121 tttaccgtca aaaatcagta caccgatgat 
3181 aggagcagtg gtggtgattt cactgacatc 
3241 atccagataa cgttgccaga agaagtgggc 
3301 gataggcagc gtcgttttga aggcgaagtc 
3361 cagcaccaca tcccctgaag tcggtgcgag 
3421 agcagctgcg ccacgactga taccaacgtt 
3481 caaaacaccc agaccggttg cggaagagac 
3541 ggcggcaatc atcagcaggt aaggggagtt 
3601 gaccaccata tcattcgcgc cgatatgggt 
3661 aatcatcatg ccgaggtcgc cgccgcggct 
3721 atccgtggcg ctgtagcctg ttgaagcctg 
3781 ggcactgata atcagcaata acaggccacc 

3 841 tttaatgatg tagcgagcta cacccacaat 
3901 tgtcagcata atttttcctg tctccaggcc 
3961 ttaaggaaaa aatatgccca ataaattggc 
4021 gccattaaga cattgaagtt gctgttttta 
4081 ttggcaatta tcacataata atgtgtgcaa 
4141 ttgcaatggt gttagctgcg gctgtgcgct 
4201 acgatatata ttatatactt cgttaagatg 
4261 tttttccttg attgcatttt gtcatcaaaa 

4 321 catcgcagaa aacgacgcat catctttaat 
43 81 cggaaataaa tagagcagct attcagatta 
4441 taattgtagc tttgctatgc tagtagtaga 
4501 aacgtgagta aatatgtcgc tatcttttcc 
4561 aaagtttttg ctaacgcaga tgagtggatg 
4621 tggcaacagc ctgaacatta tgatttatat 
4681 gcttacgaca aagaaaaaac cgatcgctat 
4741 ctgtcgcgtt gggatgaaaa aggaaactgg 
4 801 tcgtggaaca aatgggaacc gattgccgga 
4 861 gcggatgaaa attttcattt aggtctggga 
4921 tggaattaca tccctctccc ggttctactg 
4981 acttttcaga tgacctacat tccgggtacc 
5041 atgcgctttc agttttgaga caaatgaagt 
5101 aataagtaac atcaaaaata acgcgacttt 
5161 aagcgtacca caattggtgt actggtaacc 
5221 aaggtaattt tgatgtctaa gattaaaggt 
5281 ttcggtttca ttactccgga agacggcagc 
5341 cagactaatg gttttaaaac tcttgctgaa 
5401 ggtgccaaag gcccttctgc tgcaaacgta 
5461 caaaacccgc ttaatcagcg ggtttttttt 
5521 agccagaatg ccagtgcggt catggcaaaa 
5581 aatgcccagc caaagcggcc ctcttgtaac 
5641 aaggttgtta gaccgccaca aaatccggtg 



cctatgaacg gctggggcaa acgctaaccc 
gtattgatag cgccagccag aagcaaattg 
aacctaagga cgagctaccg accggcattg 
tgtttaaaga gcctggtgtg caacatacgg 
gccgcaccac tgccaggcgt tatcttgaat 
aaattgttca cggcaaagtt ggcagaccac 
aagccggagg aaacttgcct ccggcattac 
tcctggcacc atcagctctg tagcaacgat 
cgaggtgcgt tttacgactt caaacggcga 
tacgccagaa acgggcgaaa gggtacggcc 
caaatacgcc gggttaatgc cggaagagtg 
ataaaacggc gcattgcctg aaccggtcgt 
taccagcatc aggatgatac tcgccgaacc 
aataaagccg atggtgctaa gcccctgagc 
cacgttagca aacgcatctg ccatcccgcg 
ctgggtatta aagctgcgga ggaactccag 
aatagtgatg atgtgtaatt gcggacccca 
cggcgtgaac ggcaaaatgg cataaaacgc 
taacatttca tgagagatgt gctctttttt 
gatcgccatg ccgataattg cagcaattga 
aatcagcgac atttcggaag cttgcgccgc 
aataatcgcc gccggggagg cacaaatggc 
taccatcacc ggaaataggg ttgccatcag 
ggccagagac atcagacagg cgacaaaata 
aatatactgc aatggttttg acgccagctt 
catgtaagcg gcaaatccac acagcatcat 
cattagtaat attttaacgt attcaacgat 
gctggacggt aacactttgt gccccataat 
gacaaataac acaccagtgg cggaataccc 
aaccacaacc ccaataagga gctcaatgaa 
ccaaagtaaa taataaaaaa ttcctaaagc 
gatgaatgct gattaaaatc aagaaaaact 
tacatagata acaaaacgca tatttagcgg 
attgatggtg gattgatgca aatttgttaa 
caaaaaataa tctaatatga gcataggttg 
attgttgtat ctcgttaaaa aataaaataa 
aagacttggt ttttcttttt tgactattcc 
cgatgcgcgg aaatatttaa cttgaacaag 
ttctttatgt tgggtctatt aaggttatgt 
tttttgataa atgttttatg gtcacaaatg 
tttgttttta ttcagttaat cagcgttggt 
acaacgttta gagaaaatat tgcacaaacc 
attcctgcca tcacctggca tgcacgtttc 
aacgagcgac cgtggggtgg cggttttggc 
catggcctgt atgccatggc atttaaggac 
tacggatggg aaagtacctg gcgaccgctg 
ttcaccgctg gcgtaacggc acgcgataac 
ccattggcct ccgtgggtta tggcccagtg 
tacaacaatg gcaatgtgta ctttgcctgg 
tttagtaact tctttaaaat caatagctaa 
tatcactttt tagtaaagtt acactggaca 
gacacagcat ttgtgtctat ttttcatgta 
aacgttaagt ggtttaatga gtccaaagga 
aaagacgtgt tcgtacactt ctctgcaatc 
ggtcagcgcg tagagttcga aatcactaac 
atcgctctgt aagatacgtc agcaagaatt 
ggtctttagt gtgcggttga ggccgaaaac 
gaccccagaa ggttgacgaa aacgttcagt 
aaaaacacca cttctgccga aaatgttgag 
gtgattaata ctttccacac tggatcaatg 
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5701 ttcgtcatcc tgctgaacca tgcgaatcct 
5761 gctgtcagcg tccccaacgg aatcgcctga 
5821 catctcgcca cgcttcccgt accaccgcca 
5881 gcaaaatcct gctatttgat ttgtatgagt 
5941 gcgaatttta cgactctgac aggaggtggc 
6001 tacatctgtg tgggaaaaga acgctgagat 
6061 aaacgacgca tcgctgtttg ccctgccgga 
6121 agatctatcg gttaaatcag cacagctgct 
6181 gaaagtaaac gtaacatgat gacgacaatt 
6241 cgcgcatgga atatgctggt ggcacttcag 
6301 ctgcatctct atgatgcatt tgccattcag 
6361 atcgctccgt tactggaggt ggaagggatg 
6421 cgctttccag agctggcgct ggcacaggca 
6481 gccgcctggg ttcgcgggcc gctcaaagag 
6541 gcgctggata ccacctgtta tatggtggcg 
6601 caaagccgga ttatagatcc ctttggcgtc 
6661 ctcattatgg cggaagtgac gcccgaacgt 
6721 ttaaacaacc gtcgctttgc gccgccgcaa 
6781 att.cacc.ttg ttacagattg . ctattgtgtg 
6841 gtttttaagg cgcgttctgt tgccggttat 
6901 gtattaccaa actgctggta gttgcggcgc 
6961 tacgtacgct gggcggagac cttggagcgg 
7021 atgacgatgc tgcggcgaaa aaaggcgcag 
7081 ataaagagtg acgtggcgag caggacgctc 
7141 gcttcaaaaa gcgggttttt tatcagacag 
7201 ttccatccct ttcgcctgca aatcggcgtg 
7261 agcatgggta aagcccatcg ccagcgcttc 
7321 aacgtaacgt tgaaccggca ggtgatggcg 
7381 catcgtcaca ccatgacggc gcaggtcgcg 
7441 acccagtccc accatcagac cagacttggt 
7501 ttccagcagc ttcagcgacc agttgtaatc 
7561 cggtacgttt tccaggttat ggttgaacac 
7621 agcacgatcc atacgaccgc ggaaatccgg 
7681 tttttcccga atggcagtaa tgcaatccgc 
7741 atcacggtca acggaggtga taaccacata 
7801 cagtttcact ggttcattgg catcaggagc 
7861 cggacaacgg cgggtacaaa tagcgccgag 
7921 gcattccgcc aggttagggc aggaggcttc 
7981 cattgcggct ttgatgccct ggatacgtgt 
8041 ttccggcttg cgcagcaggg cttcgcgctc 
8101 catcttatcg gcatcgcggt atttaacacc 
8161 gtgcgtgttc cagttgcgaa ttacgaagga 
8221 tatcaactat ttttgaatta acgactggca 
82 81 caaagcggtc gcgaaaatgt aaaattgttg 
8341 gtgatctggc tgtaacaata ctttttcaaa 
8401 gcaggatggg atcacgcagg ctgaatttgt 
8461 catttaagga cggaaaaggc agtttttcca 
8521 aacggctggg aataatcgct aacatgtcac 
8581 tgttgtagct ggtgaaattg atttgccggt 
8641 gaccgctgaa attttgctct tccggtaaca 
8701 ctctgtcatc ttctaaagag agtagtggat 
8761 cggtgaacag aacatgatgt tgcaccgtac 
8821 gatcggtttg aaactgactg agttggtttt 
8881 aaagctgcgg atagtgagtt ttaatcgccc 
8941 ccgagggagt ggtcgcaata gttatcgttc 
9001 gcgcgccaag aatggactca aggccctgac 
9061 cggtaggagt gacaccctgg cctttgcgaa 
9121 gtttctgaat agactgactg attgccgagg 
9181 cgatcccttt atgtacatat acagcttcaa 
9241 gtaaggttgt aaatatttgt ggtttacctt 



attcctatga tgaatgcccc aatcaggttt 
tgcagtgggt taaatcgcat acttaacagc 
ataaaaactg ctaaaagaag ttgtaacact 
gataagtgta acgccgaata atcgtcgttg 
aatgctggtt gccgcaggac agtttgctgt 
ttgtgcctcg ttgatggcgc aggcggcgga 
agcattgctg gcgcgcgatg atcatgatgc 
ggaaggcgaa ttcctcggac tttacggcga 
ctgacgattc atgttccttc aacgccgggg 
gcaggaaaca tcgtcgcccg ttatgccaaa 
gaatcacgcc gtgttgatgc tggtaatgaa 
aaggtcggtc tgatgacctg ttatgactta 
ttacagggag ctgaaatcct ggtacttcct 
catcactggt caacgttgct tgccgctcgt 
gcgggggagt gcgggaacaa aaatatcggt 
accattgcgg cagcgtcaga aatgcctgca 
gtgcgtcagg tgcgcgcgca actgcccgtc 
ttattatgat gtttttttac tcggcgcttg 
cgcgcgtcga atgaccgtta atattctctg 
atgtcaagaa ggtatctatg ggtgagatta 
tggtcgttct gctgtttggg actaagaagt 
ccattaaagg gttcaagaag gcgatgaatg 
acgttgatct tcaggctgaa aagctctctc 
cctcaatatc ttgttcgata caaaaaaccc 
atgtaagtaa ttattacagg attacttaac 
gtaagaagag cggacaaacg gaccgcatgc 
ggctttcatt tcgtcgaact catccgggct 
gcttggctgc aaatattgcc ccagcgttaa 
cattacctca ataatttctt cattggtttc 
cgggatttcc ggatgcgctt ctttaaagcg 
tgcaccaggc cgtacctgac ggtaaatacg 
atctggtggc gttgcagtca gaatatccag 
caccagagtt tcaattttga tttgcgggct 
aaagtgctgg gcaccgccat cgcgcaggtc 
acgcagcgcc atatcggcaa tggtctgcgc 
taccgggcga ccgtgggcaa catcacagaa 
gatcataaac gttgctgtgc cgtggttgaa 
ctcgcagaca gaatgcaggc catttttgcg 
agagtccgct ggaagcttga ttttcatcca 
tgttgccacg tttttaaccg ggataagggc 
gcgttccatc acaatgggtt tactcatagc 
aagcgtttca attcaataga ttgttgtaat 
gtatatcatt gaaacggacc tgaaagcagc 
ttggattgtg ccattttatc gttctgcgct 
atgcattacg gataacatct attactccgt 
tgtagtggat agagaaatct atttgctcct 
gcggccagca gcggctaaac aggttataaa 
tgttggcaac cagtgcagcg attgtcaaaa 
ccggaaacat ttcttgaact ctctgccgca 
gaagtacatg cgcagcgttg tcgatagtct 
ttccctcgcg gcaaattaac accatattgt 
gattggtgca aaacatgtta tcgatgatga 
ccgcgtcgct gatgggcggg ttgcgcagta 
gatagatgac aggaaggacc agtgctccga 
gttgcttatc atagcttcct tcgatatcca 
tgatatactc atgtagatgc atcgcaaatg 
taaacaatgg gtcagggaat ataacgcgca 
gggtcagatt aagcactttc gctgcattaa 
aaatagtcag aaggttaaga tcaatatttc 
ctgacgactt tctgcttaaa cagggttcaa 
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9301 tttgattatt actatccacg cacttactcc 
9361 aaattacaat aatcttatgt ctaattggaa 
9421 tttcaatagg ttaatcaatg gggtgagtta 
9481 agtgataaag aaaatataca aacgcgaagg 
9541 aatgacccag tgtaaattgg gccattgatg 
9601 ccggattgtt tagtagcgct aaaatatttt 
9661 tcgcttcggg tttccattgt gatattttag 
9721 taatacgtaa aaatggtgaa agatccatat 
9781 aaccgcgtcg aatacgtaaa cccagtgagc 
9841 gcgcgtcagc ccgaggatgc gcttctatac 
9901 gctcaagcaa ggtcaccagt tcacgcacac 
9961 catacatcac ctgttgcccc ggcccgtgat 
10021 tcaccggaat atcacccggc attaaaatgt 
10081 ataccggata gtgctcgacc agccagattt 
10141 tgaattcatg catagcctgg gagattggct 
10201 gaattttatc ctgatacaaa acggcatctc 
10261 tata 

// 



aattttattc atggaaaaat aatatttaaa 
cggaacgctt ttgctcacca taatcaacta 
cggagcaagg cgtggcaccg catttctgca 
agatgtaaag cattagcaga tattatgagt 
tatggaatta agcggtaata tattcgaagt 
ccagtaaacg tggagcaata ttattagtcg 
ccatttccat tccggcatac ccacaaggat 
tgacgtttaa tgccagaccg tggaatgaac 
aaattttctt ttccccaaca tagacacctg 
ccagttcagc cagggtattc accactgttt 
cgagtttacg gcgtttcagg ttaagcaaca 
aagtcacctg cccaccgcga tcgctctgga 
gctccgcttt tcctgcctga ccttgggtga 
catcaagggt actatcatcg cgggtatcgg 
cgtaaggctg aagaccgagc tggcggacaa 
cgtggagaat gagtaaaagt gggggaaaag 
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